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Abstract — In this work we study zero vs. e-error capacity in 
network coding instances. For multicast network coding it is 
well known that all rates that can be delivered with arbitrarily 
small error probability can also be delivered with zero error 
probability; that is, the e-error multicast capacity region and 
zero-error multicast capacity region are identical. For general 
network coding instances in which all sources originate at the 
same source node, Chan and Grant recently showed [ISIT 2010] 
that, again, e-error communication has no rate advantage over 
zero-error communication. 

We start by revisiting the setting of co-located sources, where 
we present an alternative proof to that given by Chan and 
Grant. While the new proof is based on similar core ideas, our 
constructive strategy complements the previous argument. We 
then extend our results to the setting of index coding, which 
is a special and representative form of network coding that 
encapsulates the "source coding with side information" problem. 
Finally, we consider the "edge removal" problem (recently 
studied by Jalali, Effros, and Ho in [Allerton 2010] and [ITA 
2011]) that aims to quantify the loss in capacity associated with 
removing a single edge from a given network. Using our proof 
for co-located sources, we tie the "zero vs. e-error" problem 
in general network coding instances with the "edge removal" 
problem. Loosely speaking, we show that the two problem are 
equivalent. 



I. Introduction 

In the network coding paradigm, internal nodes of the 
network may mix the information content of the received 
packets before forwarding them. This mixing (or encoding) of 
information has been studied extensively over the last decade 
(see, e.g., 0, 0, 0, 0, and references therein). While 
network coding in the multicast setting is well understood, 
far less is know about general network coding. 

This work addresses the potential gap between zero-error and 
small non-zero error (here called "e-error") communication 
in the context of network coding. In the multicast setting, a 
single source node transmits all of its information to a set of 
terminal nodes. In this setting, the zero- and e-error capacities 
are the same, the capacity can be achieved precisely with 
linear codes, and the codes that achieve the capacity can 
be efficiently found 0, 0, 0. Hence, in this setting, 
there is no capacity advantage in relaxing the communication 
requirement and enabling an e > error in communication. 
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For general instances of the network coding problem, in 
which there may be several source nodes transmitting infor- 
mation to different subsets of terminals, it is natural to ask 
whether the same phenomenon persists. When information 
transmitted from different sources is dependent, the answer 
is negative. That is, allowing an e-error can significantly 
increase the achievable rate region, as shown, for example, 
for the Slepian-Wolf problem in 0. In the network coding 
model, however, sources are assumed to be independent. In 
this case, the question of whether there is a rate advantage 
associated with allowing an e-error remains open. 

A. Previous work 

Chan and Grant explore the rate-advantage of e-error com- 
munication over zero-error communication in 0. They use 
the notion of entropic functions (e.g., 0, 0) and their 
connection to the characterization of the network coding 
capacity 0, 0, ifTOl to show that e-error communication 
has no rate advantage over zero-error communication when 
all sources are co-located at a single node. Chan and Grant 
also study the scenario in which the sources are not co- 
located, but there is a super-node in the network that has both 
full knowledge of all the information present at the sources 
and low capacity outgoing edges connecting it with each and 
every one of the source nodes. In this scenario, they show 
that e-error communication again offers no rate advantage 
over zero-error communication [0. 

B. Our contribution 

This work begins with an investigation of the relationship 
between zero- and e-error communication in network coding. 
As in 0, we initially focus on networks with co-located 
sources. For this scenario, we present another proof that e- 
error communication offers no rate advantage over zero-error 
communication. Our proof is constructive: we show how to 
transform any e-error network code into a zero-error code at 
the price of a small loss in rate. As e tends to zero, the rate 
loss also approaches zero. Thus any rate that can be achieved 
with arbitrarily small error probability can also be achieved 
with error probability zero and arbitrarily low rate loss. The 
core ideas in the proof of (7j and our proof are similar; we 
include the proof nonetheless since the approach is central 
to proving the relationship between the e- vs. zero-error 



capacity problem and the "edge removal" problem described 
below. 

After studying co-located sources, we turn our attention to 
the index coding problem ifTTl . which is a special instance 
of the network coding problem that has seen a significant 
amount of interest recently 0j], ED, E], OH, [TJ]. The 
index coding problem captures the problem of "source coding 
with side information" in which a single server wishes to 
communicate with several clients, each having different side 
information. Although index coding simulates a single-source 
communication problem, it does not meet the definition 
of either the co-located source problem or the super-node 
problem described above. Nevertheless, the results for co- 
located sources extend naturally to the index coding setting. 

Finally, we consider the "edge removal" problem introduced 
by Jalali, Effros, and Ho in lfl6l . IfTTl . Here the goal is to 
quantify the loss in capacity that results when a single edge is 
removed from a given network. While the problem is solved 
for a variety of special cases in [16], IfTTl . many more cases 
remain unsolved. In fact, even the capacity consequences of 
removing edges that can carry asymptotically negligible rate 
are understood only in a limited family of scenarios |fl8l , 
|fl9l , lf20l . Loosely speaking, we show that the "zero- vs. e- 
error" problem in general network coding instances and the 
"edge removal" problem are equivalent. Namely, we show 
that quantifying the rate loss in the former problem would 
imply a quantification for the latter and vice-versa. 

For example, as a corollary of our equivalence, we show that 
if removing an edge that can carry asymptotically negligible 
rate (that is, an edge that can carry a number of bits that 
grows sublinearly with the coding blocklength) has vanishing 
effect on the capacity of the network then e-error network 
coding has no rate benefit over zero-error network coding. 
We stress that the former assumption is currently open. 
Our reduction between the two problems is based on our 
alternative proof for the "zero vs. e-error" problem with co- 
located sources mentioned above. 

The remainder of the paper is structured as follows. In 
Section |Tjl we define the model of study. In Section Hm 
we prove our reduction between zero- and e-error network 
coding in the co-located source and super-source settings. In 
Section IIVI we address the index coding problem. Finally, 
in Section [VJ we address the connection between the edge 
removal problem and the e- vs. zero-error capacity problem. 

II. Model 

An instance I = (G, S, T, B) of the network coding problem 
includes a directed acyclic network G — (V, E), a set of 
source nodes S C V, a set of terminal nodes T C V, and 
an IS"! by |T| requirement matrix £?Q We assume, without 
loss of generality, that each source s £ S has no incoming 

1 To be precise, both the set T and the set S should be treated as multisets 
as we allow several sources/terminals to be located at the same node. 



edges and that each terminal i e T has no outgoing edges. 
Let c e denote the capacity of each edge e £ E, and for any 
k > 0, define [k] as [k] — {1, . . . , k}. Then, for any block 
length n, each edge e can carry one of the 2 c " n messages 
in [2 Ce ™]. In our setting, each source s £ S holds a rate 
R s random variable X s uniformly distributed over [2 flsI1 ]. 
The variables of different sources are independent. A network 
code, (F : X) = ({/ e } U {gt}, {X e }) is an assignment of a 
pair (X e , f e ) to each edge e E E, and a decoding function 
{g t } to each t G T. For e = (u, v), f e is a function taking as 
input the random variables associated with incoming edges 
of node u, and X e £ [2 Ce ™] is the random variable equal to 
the evaluation of f e on its input. If e is an edge leaving a 
source node s G S, then X s is the input to f e . The input 
to the decoding function g t consists of the random variables 
associated with incoming edges of terminal t. The output of 
gt is required to be a vector of all sources required by t. 
Given, the acyclic structure of G, the network code (J 7 , X) 
can be defined by induction on the topological order of G. 

The IS" I by |T| requirement matrix B = [bij] has entries in 
the set {0, 1}, with b s j = 1 if and only if terminal t requires 
information from source s. 

A network code (J 7 , X) is said to satisfy node t under 
transmission (x s : s € S) if the decoding function g t outputs 
(x s : b(s,t) = 1) when (X s : s £ S) = (x s : s £ S). 
Network code (J 7 , X) is said to satisfy instance 1 with error 
probability e > if the probability that all t € T are simul- 
taneously satisfied is at least 1 — e. The probability is taken 
over the joint distribution on random variables (X s : s £ S). 
Namely, (T,X) satisfies instance I = (G, S,T, B) with 
error e if 

Pr [V t € T : t is satisfied under (X s : s £ S)} > 1-e 

(X B :s€S) 

An instance X to the network coding problem is said to be 
(e, R, n)-feasible if there exists a network code (J 7 , X) with 
block length n and rate H(X S ) = R (for all s) that satisfies I 
with error < e. An instance I to the network coding problem 
is said to be [e, i?)-feasible if for any 8 > there exists 
a block length n such that I is (e, R(l — 6), n)-feasible. 
Under the e-error communication model, the capacity of an 
instance X refers to the supremum over all rates R that are 
(e, i?)-feasible for all e > 0. Often, the error probability e 
becomes small as the block length n grows sufficiently large. 
Under the zero-error communication model, the capacity of 
an instance 1 refers to the supremum over all rates R that 
are (0, i?)-feasible. 

Some remarks are in place. The given model assumes all 
sources s £ S transmit information at an equal rate R. There 
is no loss of generality in this assumption as a varying rate 
source s can be modeled by several equal rate sources all 
co-located at s. 

In places throughout this work, we explicitly assume that the 
block length n is of sufficiently large size. This is in a sense 
w.l.o.g. given the following claim proven in the Appendix. 



Claim 2.1: Let I be a (e, i?, n)-feasible network coding 
instance. For any integer c > 0, there exists a block length 
n' > cn such that I is also (e, R(l — 5-^/i), n')-feasible. 

III. Our proof for co-located sources 

In this section, we consider instances 1 with co-located 
sources, showing that if 1 is (e, i?)-feasible for any e > 0, 
then it is also (0, i?)-feasible. Our proof is constructive: An 
arbitrary code with error probability e is used to design a 
zero-error code with a negligible rate loss. 

Theorem 1: Let I = (G,S,T,B) be an instance to the 
network coding problem with k sources si,...,Sft all co- 
located at a single vertex in G. If I is (e, i?)-feasible for 
all e > 0, then it is also (0, i?)-feasible. Specifically, for any 
sufficiently large block length n it holds that if I is (e, R, n)- 
feasible it is also 

log(l-e) 2\og{RnY 



0,R 1 



Rn 



Rn 



feasible. 



Proof: Let (J 7 , X) be a network code of rate R and block 
length n that satisfies I with error probability no greater 
than e. Then (J 7 , X) allows the communication of source 
random variables {X s } se s, which are all independent and 
uniformly distributed in [2^™]. Let 8 = 6(e) be a parameter 
to be defined later in the proof. In what follows, we show 
thatl is (0, R(l — S)) -feasible by constructing a new network 
code (J 7 ', X 1 ) of rate R(l — &) and the same block length n. 
Let {Y s } se s denote the (new) source random variables which 
are all independent and uniformly distributed in [2- R ( 1-<5 )"]. 
The new network code (J 7 ', X') enabling the communication 
of the random variables {Y s } uses the exact same network 
coding at internal nodes of the network and essentially the 
same decoding at terminals, the only difference is a pre- 
encoding step at the single source node (which holds all 
information on the realization of {Y s } to be transmitted). 
The ideas governing our pre-encoding are taken from the 
field of point to point channel coding (and especially the 
study of Arbitrarily Varying Channels, e.g., lETI ). where it 
is common to find an equivalence between the notion of 
deterministic coding schemes with small average error and 
stochastic coding schemes with small maximum error. 

To simplify our notation, denote the source random vari- 
ables used in the original network code (J 7 , X) by X = 
(Xi, . . . ,Xk) and the source random variables used in the 
new network code (J 7 ', A"') by Y = (Yi,...,Yfe). Denote 
a realization of X = (Xi, . . . , Xf.) by x = (xi, . . . , x&) 
and a realization of Y = (Yi, . . . , Y k ) by y = (yi, . . . , y*,). 
Let A(x) be a function with range {0, 1} that captures the 
success or failure of the original communication protocol. 
Specifically, A(x) = 1 if and only if the original protocol 
fails on realization x of X. Here, the exact notion of "fail" 
is of little significance. The analysis that follows only relies 
on the fact for X drawn uniformly at random over [2 Rn ] k , 

PrL4(x) = !]<£. 



We now construct the pre-encoding phase of our new commu- 
nication protocol (J 7 ', X 1 ). Our pre-encoding ties the source 
information y with a certain realization x of 1. The new 
network code (J 7 ', X') first maps the source information y 
to its corresponding x. Then it proceeds using the encoding 
functions specified by the original network code (J 7 , X) (with 
x as the source information). Finally, decoding is done in 
two phases: first the terminals decode using the decoding 
functions from the original code (J 7 , X) to obtain their 
relevant entries of x, and then they reverse the pre-encoding 
to obtain the corresponding entries of y. 

Our pre-encoding is based on a "random binning" argument 
and is done in two steps. First, for each source i we partition 
the set [2 Rn ] into 2 { - 1 ~ s ^ Rn groups, each of size 2 5Rn . Denote 
the partition for source i by P % = P{ , . . . , Po(i-5)«n ■ Roughly 
speaking, partition P l corresponds to random variable Yi, and 
each realization y^ corresponds to a certain set P^. in P l . 

Formally, to define the pre-encoding, we would like to tie 
each realization y = yi , • • • , y& to a certain realization x = 
Xi, . . . , Xfe to be communicated over the network (using the 
original protocol). Each Xj belongs to the set corresponding 
to y^ namely x, S P^.. (Recall that we view y; as an integer 
in [2^ s ^ Rn ].) The k sources are encoded jointly in such a 
way that A(x) = (that is, x does not cause a decoding 
error). This ensures that if we use the original communication 
protocol on realization x then the terminals can successfully 
recover the entries x^ that they require. Now each terminal 
can just check which realization y; corresponds to x^ (that 
is, find the realization y^ such that x^ G Py.) and in such a 
way decode y^. 

It is left to specify how the partitions are defined and what 
governs our mapping between realizations y and x. The 
partitions are chosen uniformly at random (and indepen- 
dently from each other). Now, for the mapping, consider 
a set of partitions P = (P 1 ,...,P k ); partition P l = 
(P 1 V..,P 2 i (1 _ ii ) R „) of alphabet [2 nR ] is used in the code 
for Yi. We say that P is good with respect to a realization 
y = (yi, . . . , y fe ) if the product set P'xP^x-x P k k 
contains a realization x = (xi, . . . , x^) such that A(x) = 0. 
Indeed, if this is the case, we may map y to x and 
communicate x without error over the network. It is left 
to show that a random set of partitions P is good for all 
realizations y (with some positive probability). In this case 
we say that P is good. Lemma [3T1 below, shows that when 
5 = 6(e) = - log ^~ E) + 21o g, (Rn) and A is the uniform 



distribution over all possible partitions of the given size, 
PrApP is good] > i. This suffices to conclude our proof 
since it proves the existence of a good set P, and the existence 
of a good set P implies a sufficient pre-encoding scheme. To 
prove that an instance 1 which is (e, i?)-feasible for all e > 
is also (0, i?)-feasible, we may use Claim |2~TI ■ 



Lemma 13.11 is an intermediate result used in the proof of 
Theorem Q] above. This result bounds the probability that a 
partition chosen uniformly at random from all partitions of 



the right size is "good" in the sense that every cell of the 
partition contains at least one element that can be decoded 
correctly by a given code. 

Lemma 3.1: Let n be sufficiently large. Let 

logO^) 21og(tfn) 
17 Rn Rn 

Let A be the uniform distribution over sets of partitions P. 

Pr[P is good] > -. 

Proof: Let the term bad be the complement of good. 
The proof works to show that 

Pr[P is bad for yl < - • 2- {1 ~ s)kRn 
A 2 

for any given y = (yi, . . . , yj.). We then obtain our assertion 
by the union bound over the 2^ 1 ~ 5 ^ >kRn values of y. 

Recall that the event 

"P is bad for y" 
is (by definition) exactly the event 

"Vx 6 P y \ x P y 2 2 x ■ ■ • x P y \ : A(x) = 1". 

When X is drawn uniformly at random, A (X) = 1 with 
probability at most e. Thus, as a mental experiment, if one 
would assume that for random P the values of x G P yi x 
P y9 x • • • x P k k are uniformly and independently distributed, 
then one would have 



Pr[ 



is bad for y] < e 



which would more than suffice for our needs. However, as the 
reader surely noticed, we are not in the setting of this mental 
experiment as there are dependencies between the different 



x in P y \ x P 2 2 x 



x Py k . In what follows we show that 



we are, nevertheless, not far from this scenario. 



Py. for each 



To simplify the notation, fix y, and let P 
i. Note that when P is chosen uniformly at random, the sets 
{P l } k =i are uniformly and independently distributed subsets 
of size 2 5Rn of [2 Rn ]. Denote the 2 SRn elements of P l as 

P l = {X^, . . . , X^gRn }■ 

While the choice of any two elements x* and x*, in cell p 
for source i are dependent the choice of any two elements 
x* and x*, for distinct sources i ^ i' are independent. 

To obtain our bounds, we analyze an event that has proba- 
bility greater than the event that we want to bound. Namely, 
we study 



diagonal elements.) Since each entry x* appears only once 
in this set, this restriction gives us the independence we need 
to simplify our analysis. 

Recall that partition cell P % and its elements {x*-}|J^* are 
random variables governed by the distribution A. Given any 
jo G [2 5Rn ] consider any realization of the random variables 
x* for all i and j =^ jo; denote this realization by TZj a . 
Claim 13.11 proved below, shows that 



Pr[A(x) oJ ...,x^) = l|^ ]<(l-2 



-Kn(l-<5)_ 



) — Rn\ — k 



Thus, by the chain rule, the intersection (over j) of the events 
"A(xj,x 2 , . . . ,x^) = 1" has probability no greater than 



(1 _ 2-fl«(i-<5) -|- 2- Rn ) k 

We next complete the proof by showing that this value is less 
than 2~( 1 ~ <5 ) fc/? "~ 1 for the <5 defined in the lemma statement 
and sufficiently large n. 

For ease of presentation, we introduce a new parameter a 
defined by a = ~'°| ( t |~ e) or equivalently e = 1 - 2~ aRn . 
In what follows we assume that n is sufficiently large such 

21og(fln) 
Rn 

the lemma statement. 



that a < 1/3. Now let 5 = a + " lu ^" ; be defined as in 



Note that 



Y t^ — aRn 



(1 - 2- aRn f Rn 



< 



(l _ 2-Rn(l-S)^k2 6R " 
aRn\2 SRn 



< 2e(l - 2- aHn ) 

where the last inequality follows since 5 is strictly less than 
1/2 and thus for sufficiently large values of n: 

(1 _ 2 -fl B (l_i)jfc2«- > (1 _ 2 -ft l( l-A'))2^^- 5 > > 1/(2e) _ 

It now suffices to show that 



aRn\2 SRn 



(1 - 2~ aKn ) 



< 2 



-kR.n 



1 

4e 



-(l-S)kRn 



The right most inequality follows from the fact that we are 
taking sufficiently large values of n and by the fact that S is 
bounded away from 1 . For the left inequality, taking In, and 
using the fact that ln(l — x) < —x, we have: 



-\5Rn 



ln(l - 2- aRn ) < - (2 Rnt - s - a ^ , 



which is less than ln(2- feKn ) for S - a > 2 '°^ ( t f" ) (for 



Pr [Vj = 1, . . . , 2 6Rn : A(x], . . . ,xf ) = l] . (1) sufficiently large n). This concludes the proof. 



Notice that the above equation does not treat the probability 
that A(x) = 1 for all (2 5Rn ) k values of x in P 1 x P 2 x 
• • • x P k . Rather, it restricts attention to 2 SRn elements 
(xj, . . . , Xj) G P 1 x • • • x P k . (We refer to these elements as 



Claim [3TTI used in the proof of Lemma I3TTI above, bounds the 
probability that the given code fails for the vector containing 
the jo'th element of each partition cell Pj when all other 
elements of {Pj}^ =1 are fixed. 



Claim 3.1: Let {P l }i£[k] = {P yi }ie[k] be the k par- 
tition cells corresponding to an observed source vector 
(yi , ■ • ■ , Yfc)- For each i, let {x| , . . . , x* 5Rn } denote the 2 5Rn 
elements of P l . Given any jo G [2 5i?n ], fix the realization x] 
for all e [k] x ([2 SRn ] \ {jo})- Denote this realization 
by TZj . Then 



,x) o ) = 1 | ft jo ] < (i_2- K "( 1 -^+2- R ")- fc e 



Proof: By our definition of the random variables {x*}, 
for any realization lZj the variable xj o is uniformly dis- 
tributed in [2 Rn ] \ {^)}jjtj - Thus, the vector x,* = 
x* 5 is uniformly distributed in a subset of 



30 ' 3a ' 



[2 An ]^"of size T = {2 Rn - 2 SRn + l) k = 2 Rnk {l - 
2 -Rn(i-S) + 2 -Rn-)k As A (^j = i only on an e fraction of 
x G [2 Rn ] k , we conclude that the probability that A(5c) = 1 
when x is uniform in any subset of size greater than T is at 
most 2 Rnk e/T. This concludes our assertion. ■ 

Remark 3.1: Essentially the same argument (of Theorem Q3 
can be applied to the scenario in which the sources are not co- 
located but there is a super-node that knows all of the source 
information {y s } se s and has links to all sources s £ S with 
capacity which asymptotically (in n) tends to zero. In this 
case, for every y, the super-node computes the pre-encoding 
x = (xi, . . . jXfc) and sends to source i the location of x^ 
in Py. . This information can be transmitted using capacity 
SR links. Notice that by the analysis of Theorem Q] one may 
always take the location of x^ in P y . to be identical for all 
i (as we analyzed the diagonal event in Equation (Q])). This 
fact is used in Section [V] As source i knows y^, and the good 
P is known at all source nodes (and the super-node) - once 
each source knows the location of x^ in Py., it can transmit 
x; as desired. 

IV. e-ERROR VS. ZERO-ERROR FOR "INDEX CODING" 

In this section we study special instances X to the network 
coding problem known as index coding instances. We show 
that for these special instances, one can prove a theorem 
similar to Theorem[T]even though the sources of index coding 
instances are not co-located. 

We begin with a definition of the instances X = (G, S, T, B) 
corresponding to the index coding problem. The set S is 
a set of k sources {si,...,Sfc}. The set T is a set of k 
terminals {t\, . . . The graph G consists of the vertices 
{sx, ■ • ■ , Sfc} and {ti, . . . , tk] and two special vertices u and 
v. The edge set of G consists of an edge (si,u) from each 
source node Si to the bottleneck input node u, an edge (u, v) 
which is the network bottleneck, an edge (v,tj) from the 
bottleneck output v to each terminal node tj, and a collection 
of side information edges (si,tj) directly from source Si to 
receiver tj for some subset E S id e C S x T of source-terminal 
pairs. All edges are of capacity 1. The requirement matrix B 
and the side information edges E S id e characterize the index 
coding instance. The index coding problem encapsulates the 



"source coding with side information" problem in which a 
single server wishes to communicate with several clients, 
each having different side information. 

Now that we have defined index coding instances, it may be 
clear to the reader why the proof of Theorem Q~]should extend 
to these instances as well. In index coding, encoding is done 
only at the bottleneck input node u, and node u has access 
to all of the source information {Jf s } s6 s. We formalize this 
intuition below. 

Theorem 2: Let X be an instance of the index coding prob- 
lem with k sources si,...,Sfe. If X is (e, i?.)-feasible for 
all e > 0, then X is also (0, i?)-feasible. Specifically, for 
any sufficiently large block length n, it holds that if X is 
(e, R, n) -feasible it is also 



0,i? 1 



log(l-e) 2log(Rn) 



Rn 



Rn 



. n 



feasible 



for a slightly larger block length n' of size n' = n — log(l — 
e) + 21og(i?n). 

Proof: As mentioned above, the proof follows the line 
of proof given in Theorem Q] To obtain a zero error network 
code (J 7 ' ,X') from an e error code (T,X), one performs 
a pre-encoding step at node u (which has knowledge of all 
source symbols) and uses the ideas specified in Remark |3TI 
to allow decoding. Specifically, using the notation of Theo- 
remQ] each source s; sends its information y^ on its outgoing 
edges. Node u, after receiving y, uses the pre-encoding 
procedure and obtains x. Using x and the original network 
code (J 7 , A"), u determines z the transmitted message on 
its outgoing edge (u, v). In addition, u acts as the super- 
node in Remark 13.11 and appends to z the (single) index 
specifying for all i the location of x^ in P y .. The fact that 
we are appending additional information of rate 5Rn = 
— log(l — e) + 2 log(i?n) to z is possible as the new network 
code has block length n' =n — log(l — e) + 21og(i???). 

For decoding, terminal tj receives the message z, the mes- 
sages yi from edges (si,tj) £ E S id e , and the location of Xj 
in P y . for each such edge (si,tj) £ E S id e - Using this infor- 
mation, tj can reconstruct x.; for each edge (si,tj) £ E S id e 
and thus use the decoding scheme of the original network 
code (J 7 , X) to obtain any source information x^ it requires. 
Finally, tj can invert the pre-encoding to obtain the messages 
yi it requires. ■ 

V. Connection to the "edge removal" problem 

In this section, we discuss connections between the question 
of zero- vs. e-error network coding capacities and the ques- 
tion studied in lfl6l . ifTTl addressing the maximum change in 
capacity that can result when a single edge is removed from a 
network. Namely, we consider the following two propositions 
and show that they are equivalent. 

Proposition 5.1 (Error reduction): Let e = e(n) > 0. Let 



a = a(n) 



_ -log(l-e) 



so e = 1 - 2- 



Let e' £ [0,1/2] 



There exists a universal constant c\ such that any instance 

X = (G,S,T,B) that is (e' + (1 - e')(l - 2- an ),R,n)- 
feasible is also (e', R — c\a, n)-feasible. 

Proposition 5.2 (Edge removal): Let X = (G, S, T, B) be an 
instance of the network coding problem. Let e £ G be an 
edge of capacity a. Let X' = (C , S,T, B) be the network 
coding instance obtained by replacing G with the network 
G in which edge e is removed. Let e' £ [0,1/2]. If X 
is (e', R, n) -feasible then X' is (e',i? — C2a, n)-feasible for 
some universal constant C2. 

We note that in Proposition 15.11 a network code's error 
parameter e may be a function of the code's block length n. 
Thus both propositions are stated explicitly with the block 
length parameter n. In addition, Proposition 15.11 slightly 
generalizes the "zero- vs. e-error" problem to the problem 
of "error-reduction," in which we seek to show that an 
(e' + (l — s')e, R, n)-feasible instance is also (e 1 , R~8,n) for 
a suitable S = S(e). Here, both e and e' are error parameters, 
the initial error term is expressed as e' + (1 — e')e (which 
implies that the error term is always less than or equal to 1), 
and we seek to reduce the error from e' + (1 — e')e to e'. 
When e' = 0, this is the familiar "zero- vs. e-error" problem; 
we here treat the general case. 

We now show that Proposition 15.11 holds if and only if 
Proposition 15.21 holds. Specifically we present two theorems 
below (one for each direction). 

Theorem 3: Proposition 15.11 with parameter c\ implies 
Proposition 15.21 with parameter C2 equal to c\. 

Proof: Let I = (G, 5, T, B) be an instance to the 
network coding problem. Let e G G be an edge (of capacity 
a). Let X' = (G' , S,T, B) be the network coding instance 
obtained by replacing G with the network G' in which the 
edge e of capacity a is removed. Let X be (e' , R, n) -feasible, 
and consider the corresponding network code (X, X). As 
studied in fl6l . ifTTl . consider the value £ [2 an ] trans- 
mitted on e for each and every setting of source information 
x = (xi, . . . ,Xfe) that results in correct decoding. Here, as 
before, we take x.; £ [2 Rn ]. By an averaging argument, there 
exists a value y £ [2 an ] such that 

P^iUx — U I x results in correct decoding] > 2~ an . 

X 

We construct a new code (J 7 ', X') fori' which equals (X, X) 
on all functions except the functions corresponding to edges 
leaving head(e). These changed functions use the fixed value 
y as input instead of the value X e = in the original code 
(X, X) for X. As (J 7 ', X') is identical to (J 7 , X) when y^ = 
y, it holds that (J 7 ', X') is a blocklength-?i code with rate R 
and error probability at most e' + (1 -e')(l - 2~ Q ™). Thus X' 
is (e' + (1 - s')(l - 2- Q "), R, n)-feasible. If Proposition O 
holds, then this implies that I' is (e',R — c±a, n)-feasible. 
Thus Proposition 15.21 follows, with C2 = c\ , ■ 

Theorem 4: Proposition 15.21 with parameter c 2 implies 
Proposition 15.11 with parameter ci = c 2 + 1 + 2 lo ^f — 



given that (in Proposition 15.1b n is sufficiently large and 

a(n) = " lo f- £) < 1/3. 

Proof: Below, we consider the case in which e' = 0. 
A similar analysis also holds for e 1 > (see remark located 
at end of proof). Let I = (G, S, T, B) be an instance to the 
network coding problem that is (1 — 2~ Q " , R, n)-feasible. We 
show that I is also (0, R — cia, n)-feasible. 

We consider 2 additional instances T\ = (Gi, S±,T, R) and 
I2 = (G2, S2,T, R) similar to those considered in Q. We 
start by defining the network G2; network Gi is then obtained 
from network G2 by a single edge removal. 

Network G2 is obtained from G by adding k new source 
nodes s[, - ■ ■ ,s' k , a new "super-node" s, and a relay node 
r. For each s; 6 G, there is a capacity-i? edge (s^, Si) from 
new source to old source Si. For each £ G2, there is a 
capacity-i? edge (s^, s) from new source to the super-node 
s. There is a capacity-^ edge (s,r) from the super-source s 
to the relay r; this edge is the network bottleneck and the 
bottleneck capacity S equals a + Hi£SX5") . (Notice that the 
value of 5 is set to satisfy the requirements in Theorem Q] as 
we have normalized by R.) Finally, the relay r is connected 
to each source node Si by an edge (r, Si) of capacity 8. The 
new source set S2 is {s[, . . . , s' k }. For Xy, we set S\ = S2, 
and remove the bottleneck edge (s,r) of capacity <5. 

We prove the desired result by demonstrating the following 
properties: 

(a) Instance I2 is (1 — 2~ Q ™, R, n) -feasible. 

(b) Instance I2 is also (0, R — a — 2 ]2SLEsH ) ; n)-feasible. 

(c) Instance X x is (0, R - a - 21og ra (to) - c 2 a), n)-feasible. 

(d) Instance X is also (0, R — c±a, n)-feasible for c\ — C2 + 

1 , 21og(fln) 
an 

The proof of (a) follows from our construction since X is (1 — 
2 _Q ™, R, n)-feasible by assumption. The proof of (b) follows 
by applying Theorem [T] (or more specifically Remark |3Tt to 
X2. The proof of (c) follows by removing edge (s, r) from 
G2 to obtain Gi, and then applying Proposition 15.21 Finally, 
for (d), we note that by our construction, any code (X, X) 
that is feasible for Xi is also feasible for X. 

For e' £ (0, 1/2], in (b) above we may reduce the error from 
e' + (1 - s')(l - 2- an ) = 1 - (1 - e')2- an < 1 - 2~ an - 1 
to via Theorem [T] by considering a + 1/n instead of a. 
Modifying the proof of Theorem Q] slightly, this also implies 
a value of ci = c 9 + 1 + 2 IgsCggO as s tated in the assertion. 

± z an 

■ 

We note that the reduction above implies in particular that: 

Corollary 5.1: If for capacities a that vanish in the block 
length (i.e., a = o(l)) Proposition 15.21 holds with c 2 such 
that c 2 a = o(l) then a network coding instance X which is 
(e, i?)-feasible for all e > is also (0, i?)-feasible. 

It is interesting to point out that connections similar to those 
of Corollarv l5.1l also exist between the edge removal problem 



for vanishing a and the strong converse problem studied in 
EH), lfl9l . Il20l . This forges an intriguing connection between 
the three problems. 

VI. Concluding remarks 

In this work we have studied the potential gain in allowing 
e-error communication when compared to zero-error com- 
munication in the network coding scenario (where source 
information is independent). For the setting of co-located 
sources (and also that of index coding) we present an 
alternative proof to that of Chan and Grant Q, which allows 
us to prove an equivalence with the edge removal problem 
of lfl6l . ifTTl . Both the capacity loss in the edge removal 
problem, and the potential gain in capacity when allowing 
an e > error in network communication remain open in 
this work. Nevertheless, our equivalence shows that there is 
no gain in e- vs. zero-error communication if one can prove 
that the removal of an edge of low (vanishing) capacity has 
low (vanishing) effect on the communication capacity of the 
network at hand. 
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Appendix 

A. Proof of Claim\2j] 

Proof: Roughly speaking, the proof is obtained by 
applying a standard argument in which one uses the orig- 
inal (e,R,n) coding scheme over multiple time instances 
combined with a carefully chosen outer code. 

We start by setting some notation. Consider the original 
(e, R, n) communication protocol. Let the source random 
variables be X\,...,Xk; each Xi uniform in [2 Rn j. For 
source realization x = Xi,... , x^, let A(x) be a function 
with range {0,1} that captures the success or failure of 
the original (e, R, n) communication protocol. Specifically, 
A(5t) = 1 if and only if the original protocol fails on 
realization x of X. For c' > c and a rate R 1 to be specified 
shortly, we now consider an (e, R', c'n)-feasible communi- 
cation protocol obtained by applying the original protocol 
over c' time instances (to obtain total block length c'ri). 
Namely, let Y = Y x , . . . ,Y k , with Fj uniform in [2 R ' c ' n ], 
be the new source information, and y = yi, • • • ,yfc denote 
its realization. Consider an encoding Ci : [2 R c n ] — > [2 Rn ] c 
for each (new) source i € [k]. For an input to Ci let 
Ciijfi) = x? = Xjj, . . . , Xi. C ' be the encoding of realization 
Yi. Here, for each pair i, j it holds that x. ; j E [2 Rn ]. 

The new protocol has the following natural structure: the 
source input is first encoded (at each source) to obtain 
x^ . The encoded source information x^ = x^i, . . . , Xi )C ' 
is now transmitted over the network using the original 



protocol over c! rounds of communication. The terminal 
node t (after receiving the information of each and every 
communication round) first uses the original protocol to 
decode a (possibly corrupted) version z\ = z^i, . . . , Zj iC / 
of x^ = Xi i, . . . , Xj jC '; and then uses the error correcting 
capabilities of code Ci to obtain x^ = x^x, . . . , x, iC ' and 
thus y; (with high probability). 

We now analyze the new (block length c'n protocol). Recall, 
that for a random x € [2 Rn ] k , it holds that A(x) = 
with probability at least 1 — e. This implies that for random 
inputs {x^ }i = {xi i, . . . ,Xi C /}i it holds that the expected 
Hamming distance between z\ and x£ (defined above) is at 
most ec 1 . Using the Chernoff bound, we conclude for random 
{x|'}i that 

Pr[Vz = 1, . . . , k : ||xf - zf\\ H < 2ec'} > 1 - k2~ n ^ EC '\ 

(2) 

Now, for each i, consider the code d obtained by taking 
any rate r' = R' /R code of minimum distance d! = 4ec' + 1 
over the alphabet [2 Rn ] and applying an independent random 
permutation on each of its c' coordinates. 

We prove that with high probability over the random per- 
mutations defining {Ci}i, it holds that with probability at 
least 1 — e over the source information {Yi} that indeed 
||x£ — z\ || # < 2ec'. As we will show, the above assertion 
will essentially suffice to prove the claim. For the assertion, 
notice that for any y = yi, . . . , the corresponding values 
in {Ci(yi)}i = {xf }i = {x i; i, . . . ,x^ c /}i are all indepen- 
dent and uniformly distributed in [2^™]. Thus, by Equation [2] 
the expected number (where the expectation is taken over the 
permutations defining the codes {Ci}i) of source realizations 
y for which the corresponding {x° }i and {zf }i satisfy 
Vi : ||x?' - zf\\ H < 2ec' is at least 2 R ' c ' nk (l - fc2-°( £C ')). 
This implies the existence of a set of permutations (and 
correspondingly a set of codes {Ci}i) for which the number 
of source realizations y for which the corresponding {x? }i 
and {z? }i satisfy Vi : || — z? < 2ec' is at least 
2 R'c'nk(i _ fc2-°( ec ')). Taking d large enough such that the 
term k2~ a< > £C ' in Equation |2] is at most e, and using the 
fact that the codes Ci all have minimum distance 4ec' + 1, 
we conclude that the new protocol is indeed (e, R' , de- 
feasible. It remains to specify the value of r' (and thus that 
of R' = Rr'). 

The rate r' is set to be the highest rate for which there exist 
codes of block length d and minimum distance d' = 4ec' + 
1 over alphabets of size 2 Rn . Using the Gilbert- Varshamov 
bound |22), J23] we can set r' > 1 - H 2 n n (Ae + 1/d) > 
1 — i?2(4e + 1/c') > 1 — for large enough values of c'. 
Here, H q denotes the g-ary entropy function. ■ 
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